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Abstract. Given any generating set of any pseudo-Anosov-containing sub- 
group of the mapping class group of a surface, we construct a pseudo-Anosov 
with word length bounded by a constant depending only on the surface. More 
generally, in any subgroup G we find an element / with the property that the 
minimal subsurface supporting a power of / is as large as possible for elements 
of G; the same constant bounds the word length of /. Along the way we find 
new examples of convex cocompact free subgroups of the mapping class group. 



1. Introduction 

Consider Mod(S'), the mapping class group of a surface S, and its action on the 
isotopy classes of simple closed curves on S. If all elements of a subgroup fix no 
common family of curves, then the group contains a pseudo-Anosov, that is, a single 
element which itself fixes no finite family of curves [Iya92 . This paper answers in 
the affirmative Fujiwara's question of whether one can always find a "short-word" 
pseudo-Anosov (Question 3.4 in |Fuj08| ). Where E generates the group G, let E- 
length denote the length of an element of G in the word metric induced by E. The 
affirmative statement is: 

Theorem 1.1. There exists a constant K — K{S) with the following property. 
Suppose G < Mod(S') is finitely generated by E and contains a pseudo-Anosov. 
Then G contains a pseudo-Anosov with Yi-length less than K . 

The proof provides an explicit construction of pseudo-Anosovs from arbitrary 
non-pseudo-Anosov elements. In fact, it addresses a broader question. Roughly 
speaking, a pseudo-Anosov requires the whole surface for its support. The other 
elements are called reducible, because they allow one to "reduce" the surface in 
question. That is, reducible mapping classes have powers which fix proper sub- 
surfaces and act trivially or as a pseudo-Anosov on those subsurfaces. Mosher has 
described a unifying approach to both types |Mosj , associating to a mapping class / 
what he calls its active subsurface A(f). For the sake of introduction one may think 
of A(f) as the smallest subsurface supporting some power of / (noting that these 
subsurfaces and thus their inclusion arc defined up to isotopy) and observe that 
/ is pseudo-Anosov exactly when A(f) = S. Then several foundational mapping 
class group theorems, including the Tits alternative for Mod(S') |Iva92i lMcC85| . 
and subgroup structure results from |BLM83j and 



what Mosher coined the Omnibus Subgroup Theoren given a group G < Mod(S') 
there exists / G G such that for all g € G, A(g) C A(f). Call such an / full-support 



Iva88j . elegantly derive from 
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■'"We should clarify that Mosher formulated the Omnibus Theorem to yield as corollaries certain 
results we need to prove Theorem |3.1| Mosher wanted streamlined proofs for transposing to the 
key of Out(F n ) (see |HM09I for progress). Content to use the old results as-is, we define active 
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for G. In this paper we actually prove the following, which includes Theorem |1.1| 
as a special case. 



Theorem |3.1| (Main Theorem) There exists a constant K = K(S) such that, for 
any finite subset £ C Mod($), one may find f full-support for (E) with "S-length 
less than K . 



The proof of Theorem |3.1| spells out short pseudo-Anosovs explicitly, with the 
following core construction, concerning special pairs of pure reducible mapping 
classes we will call sufficiently different. These are pure mapping classes a and b 
with pseudo-Anosov restrictions to proper subsurfaces A and B respectively, such 
that A and B together fill S, meaning that each curve on S has essential intersec- 
tion with either A or B. The proposition also identifies subgroups whose action on 
the curve complex gives a quasi-isometric embedding, so that they are convex co- 
compact Ham05 , KL08a] in the sense defined by Farb and Mosher |FM] in analogy 
to Kleinian groups. This last part is proven for interest, and is not necessary for 
the main theorem. 

Proposition 1.2. There exists a constant Q — Q(S) with the following property. 
Suppose a and b are sufficiently different pure reducible mapping classes. Then for 
any n,m > Q, every nontrivial element of G = (a n ,b m ) is pseudo-Anosov except 
those conjugate to powers of a n or b m . Furthermore, G is a rank two free group, 
and all of its finitely generated all-pseudo- Anosov subgroups are convex cocompact. 

In [ManlOj . the author considers a more general condition for pairs of pure re- 
ducible mapping classes, and finds Q such that G as above is a rank two free 
group, but need not contain pseudo-Anosovs (and therefore need not be convex 
cocompact). A more relevant comparison is Thurston's theorem providing the first 
concrete examples of pseudo-Anosovs |Thu88j (see also |Pen88] 1. He proved that 
if a and b are Dehn twists about filling curves, then one can find an affine struc- 
ture on S inducing an embedding of (a, b) into PSL(2, R) under which hyperbolic 
elements of PSL(2,R) correspond to pseudo-Anosovs in (a, b). In particular, every 
nontrivial element of the free semigroup generated by a and b~ l is pseudo-Anosov. 
More recently, Hamidi-Tehrani [HT02 classified all subgroups generated by a pair 
of positive Dehn multi-twists. In particular, if a and j3 are multicurves whose union 
fills S, and a and b are compositions of positive powers of Dehn twists about com- 
ponents of a and /3 respectively, then except for finitely many pairs n, m, (a™, b m ) is 
a rank-two free group whose only reducible elements are those conjugate to powers 



of a or b (see also }Ish96j ). In a different light, one can consider Proposition 1.2 
companion to a theorem of Fujiwara that generates convex cocompact free groups 
using bounded powers of independent pseudo-Anosovs; this theorem appears in 
Section [3.3. 21 as Theorem 13.21 

Acknowledgments. Atop my lengthy debt of gratitude sit Dick Canary and 
Juan Souto for everything entailed in advising me to PhD (out of which work this 
paper emerged) and Chris Leininger for so many tools of the trade. 

2. Preliminaries 

This section consists of five parts. The first presents basic definitions around the 
mapping class group and curve complex; the second relates the curve complexes of 



subsurfaces so that the Omnibus Theorem rephrases a known theorem (see Section [2.4^ ; meanwhile 
we reap the benefits of its perspective and terminology. 
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a surface and its subsurfaces. Section |2.3| presents a different view of subsurfaces, 
which facilitates Section [2~4| on the case for understanding reducible mapping classes 
as subsurface pseudo-Anosovs. Finally, Section |2.5| recalls powerful curve complex 
tools with which we rephrase the classification of elements of Mod(S). 

2.1. Mapping class group and curve complex. Throughout, we consider only 
oriented surfaces whose genus g and number of boundary components p are finite. 
Define the complexity £(S) of a surface S by £ = 3g + p — 3. We neglect the 
case where £ is —2 or zero, which means S is a disk, a closed torus, or a pair 
of pants, because these are never subsurfaces of interest, as explained in Section 
|2.2| Annuli, for which £ = — 1, feature throughout, but primarily as subsurfaces of 
higher-complexity surfaces. Let us first assume £ > 1, and address the annulus case 
after. Note that our definitions will not distinguish between boundary components 
and punctures, except on an annulus. One may find in |FMj a discussion on variant 
definitions of the mapping class group. 

Given a surface S, its mapping class group Mod(S') is the discrete group of 
orientation-preserving homcomorphisms from S to itself that setwise fix compo- 
nents of dS. Much (arguably, everything) about Mod(S') appears in its action on 
the isotopy classes of those simple closed curves on S that are essential: neither 
homotopically trivial nor boundary-parallel. Let us call these classes curves for 
short. Pseudo-Anosov mapping classes are those that fix no finite family of curves; 
necessarily these have infinite order. Among the rest, we distinguish those that 
have finite order, and call the remaining reducible. 

The intersection of two curves a and /3, written i(a,/3), is the minimal number 
of points in a' PI /?', where a' ranges over all representatives of the isotopy class 
denoted by a and likewise /3' ranges over representatives of (3. Often, we will use the 
same notation for a curve as an isotopy class and for a representative path on the 
surface. For specificity and guaranteed minimal intersections, one may represent 
all curves by closed geodesies with respect to any pre-ordained hyperbolic metric 
on S. Two curves fill a surface if every curve of that surface intersects at least one 
of them. Disjoint curves have zero intersection. All of these definitions extend to 
multicurves, our term for sets of pairwise disjoint curves. 

Now let us upgrade the set action of Mod(S') on curves to a simplicial action 
on the curve complex of S, denoted C(S). Because C(S) is a flag complex, its 
data reside entirely in the one-skeleton Ci(S): higher-dimensional simplices appear 
whenever the low-dimensional simplices allow it. Thus it suffices to consider only 
the graph Cx(S), although we usually refer to the full complex out of habit. 

Curves on S comprise the vertex set Cq(S), and an intersection rule determines 
the edges of Ci(5). For a surface with complexity £ > 1, edges join vertices rep- 
resenting disjoint curves. Therefore n-simplices correspond to multicurves with n 
distinct components, and £ gives the dimension of C(S). When £ = 1, S is a punc- 
tured torus or four-punctured sphere. Because on these any two distinct curves 
intersect, we modify the previous definition so that edges join vertices representing 
curves which intersect minimally for distinct curves on that surface — that is, once 
for the punctured torus and twice for the four-punctured sphere. In both cases 
C(S) corresponds to the Farey tesellation of the upper half plane (vertices sit on 
rationals corresponding to slopes of curves on the torus). 

Give Ci(S) the path metric where edges have unit length; this extends simpli- 
cially to the full complex. For any two curves a and j3, let d(a,/3) denote their 
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distance in Ci(S). Immediately one may observe that C(S) is locally infinite. It 
is not obvious that C(S) is connected, but the proof is elementary |Har81| . Fur- 
thermore, it has infinite diameter |MM99j . A deep theorem of Masur and Minsky 
states that C(S) is (5-hyperbolic, meaning that for some 5, every edge of any geodesic 
triangle lives in the ^-neighborhood of the other two edges [MM99 . 

Now suppose S is the annulus S 1 x [0,1]. We modify the definition of the 
mapping class group to require that homeomorphisms and isotopy fix dS pointwise. 
Parameterizing S 1 by angle 9 and S by (9, t), the Dehn twist on S maps (9, t) to (9+ 
2wt, t). The cyclic group generated by this twist is the entire mapping class group of 
the annulus. In this group, let us consider all non-trivial elements pseudo-Anosov, 



for reasons clarified in Section 2.5 To define C(S), we contend with the fact that an 
annulus contains no essential curves. Instead, each vertex of Cq(S) corresponds to 
the isotopy class of an arc connecting the two boundary components, again requiring 
isotopy fix boundary pointwise. Edges connect vertices that represent arcs with 
disjoint interiors. Although this complex is locally uncountable, it is not difficult to 
understand: the distance between two distinct vertices equals one plus the minimal 
number of interior points at which their representative arcs intersect. Again, C(S) 
is connected, infinite diameter, and 5-hyperbolic — in fact, it is quasi-isometric to 
the real line, and Mod(5) acts on it by translation. Note that our "curve complex" 
for the annulus is more accurately called an arc complex. Generally, we aim to 
minimize the distinction between annular subsurfaces and subsurfaces with £ > 1; 
for extended treatment of curve complexes and arc complexes, see }MM00| . 

2.2. Subsurface projection. Let us relate the curve complex of S to that of 5", 
where £(5) > 1 and S' is an "interesting" subsurface of S. Here, a subsurface is 
defined only up to isotopy, and assumed essential, meaning its boundary curves 
are either essential in S or shared with dS. This rules out the disk. We also 
disregard pants, which have trivial mapping class group. For the remainder of 



Section 2.2 we assume S' is connected; furthermore, either £(<S") > 1 or S' is an 
annular neighborhood of a curve in S. 

For ease of exposition and the convenience of considering dS 1 a multicurve in S, 
let us make a convention that OS' refers only to those boundary curves essential 
in S. In later sections we consider multiple, possibly nested subsurfaces, but these 
are always implicitly or explicitly contained in some largest surface S. 

Except when S' is an annulus, it is clear one can embed C(S'), or at least its 
vertex set, in C(S), but we seek a map in the opposite direction. One can associate 
curves on the surface to curves on a subsurface via subsurface projection, a notion 
appearing in [ Iva88 [ IIva92] , expanded in MMQO] , and recapitulated here. In what 
follows, we define the projection map ns> from Cq(S) to the powerset V(Co{S')). 
To start, represent 7 G Cq(S) by a curve minimally intersecting dS'. 

First suppose S' is not an annulus. If dS') = 0, then either 7 C S", and we 
let ns'i'j) — {7}, or 7 misses 5", and we let 7^(7) be the empty set. Otherwise, 7 
intersects dS' . For each arc a of jCiS', take the boundary of a regular neighborhood 
of aUdS' and exclude the component curves which are not essential in S' . Because 
5" is neither pants nor annulus, some curves remain. Let these comprise ^5/(7). 

In the case that S' is an annulus, when i(j,dS') — we let 7^(7) be the 
empty set. When 7 intersects dS', one expects ^5/(7) to consist of the arcs of 7 
intersecting 5". However, ambiguity arises because curves such as 7 and dS' are 
defined up to 9S'-fixing isotopy, but vertices oiC(S') represent arcs up to <9S"-fixing 
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isotopy. To remedy this, give S a hyperbolic metric. Consider the cover of S 
corresponding to the fundamental group of S' embedded in that of S. This cover is 
a hyperbolic annulus endowed with a canonical "boundary at infinity" coming from 
the boundary of two-dimensional hyperbolic space (the unit circle, if one uses the 
Poincare disk model). Name the closed annulus A, and let C(A) stand in for C(S'). 
Each 7 <= Cq (S) has a geodesic representative which lifts to A, and if 7 intersects 
5', some of the lifts connect the boundary components. Let these comprise 715(7). 

We will frequently say something like 7 projects to S' to mean ns> (7) is not the 
empty set. Where X is some collection of curves {7^}, let irs>(X) = Ui 7r S'(7i)- If 
tts'(X) is not empty, let diams'(X) denote its diameter in C(S'); omit the subscript 
S' to mean diameter of X in C(S) itself. We are content with a map from C(S) 
to subsets of C(S'), rather than C(S') directly, precisely because multicurves have 
bounded-diameter projection: 

Lemma 2.1. Suppose 7 is a multicurve of S, and S' a subsurface. If '7 projects to 
S', diam s <(7r s <(7)) < 2. 

This fact appears as Lemma 2.3 in [MMOOj . Note that complexity £ in jMMOOj 
differs from our definition by three. Also, there is a minor error in Lemma 2.3, 
corrected in IMin03] (pg. 28) , which we avoid by stating the lemma for multicurves 
rather than simplices of C(S) — these do not coincide if £(S) = 1. 

When multicurves a and /3 both project to S' , define their projection distance 
ds'(a,/3) as diams' (7^ (a) U tts' (/?))• This is a "distance" in that it satisfies the 
triangle inequality and symmetry, but it need not discern curves: Lemma |2.1| im- 
plies that disjoint multicurves have a projection distance of at most two, when 
defined. In the other extreme, one easily finds examples of curves close in C(S) 
with large projection distance in C(S') — a small illustration of the great wealth of 
structure that opens up when one considers not only C(S) but curve complexes of 
all subsurfaces (see, for example, [MMOOj ). 



2.3. Cut-coded subsurfaces and domains. We now present an alternate defi- 
nition of subsurface that ducks the nuisance of disconnected subsurfaces containing 
annular components parallel to the boundary of other components. These are the 
only subsurfaces capable of being mutually nested (via isotopy) yet not topologically 
equivalent. Our technical antidote may seem tedious, but as an upside, it trans- 
lates our current notion of a subsurface into an object encoded unambiguously by 
curves, a recurrent theme of this paper. Moreover, the new viewpoint facilitates the 
next section's definition of active subsurfaces, based on Ivanov's work on Mod(S) 
subgroups. The efficient reader is welcome to skim the definition, taking note of 
Lemmas 2.2 and 2.3 and rely on Figure [l] for intuition. 

A cut-coded subsurface of S consists of two pieces of information: (1) a multicurve 
7 and (2) a partition of the non-pants components of >S\7 into two sets: excluded 
and included components. 

Let us clarify what subsurface S' this data is meant to describe. Label the 
component curves of 7 by 71 , . . . 7„ , and the included components A\ , . . . A m . The 
multicurve 7 contains each component of dAi, but some jj may not be a boundary 
curve for any Ai. Let Nj be regular neighborhoods of those belonging to no dAi. 
Then S' consists of the union of the A*, seen as subsurfaces, and the implied annuli 
Nj. Ignoring multiplicity, OS 1 and 7 are the same multicurve. 
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Figure 1. How to define a cut-coded subsurface. From left to right: (1) 
Set multicurve. (2) Choose included components (shaded). (3) Domains 
(shaded) correspond to included components and implied annuli. 



Call the Ai component domains and the Nj annular domains of S'. We use 
domain to refer to either kind, or any subsurface that may appear as a domain — in 
other words, any connected (essential) non-pants subsurface. 

Not all subsurfaces are cut-coded, as the latter never include pants, parallel 
annuli, or annuli parallel to the boundary of a component domain. Cut-coded 
subsurfaces are exactly those appearing as active subsurfaces of mapping classes, 
which the next section details. 

The cut-coded subsurfaces of S admit a partial order c detected by subsurface 
projection. Say a subsurface A nests in B if A may be isotoped into B. Now 
suppose A and B are cut-coded subsurfaces with domains Ai and Bj respectively. 
Say A C B if every Aj nests in some Bj. Transitivity and refiexivity of C are 
obvious, but antisymmetry is relatively special. If A c B and B C A, one can 
check A and B are given by the same data as cut-coded subsurfaces, ultimately 
because they never contain an annulus parallel to another domain. In contrast, 
general disconnected subsurfaces can be mutually nested but not isotopic. 

Call two subsurfaces disjoint if they may be isotoped apart, and overlapping if 
they arc neither disjoint nor nested; note that overlapping subsurfaces are distinct 
by definition. Projection determines relations between domains: 

Lemma 2.2. Suppose A and B are domains in S . 

(i) TTA{dB) is empty if and only if A nests in B or its complement. 

(ii) A and B overlap if and only if dA projects to B and dB to A. 

(iii) If dA projects to B but TTA{dB) is empty, A nests in B. 

(iv) If7TA(dB) and ttb(9A) are both empty, A and B are equal or disjoint. 

Proof. Statements (ii) - (iv) derive from (i). The forward implication of (i) requires 
connectedness of A: one may choose curves a± and a-i that fill A, so that A is 
represented by a regular neighborhood of a\ U ai with disks and annuli added 
to fill in homotopically trivial or 95-parallel boundary components. If tta(9B) is 
empty, then dB is disjoint from aiU«2. Therefore the two curves, and consequently 
A itself, can be isotoped either entirely inside B or into its complement. The reverse 
implication is self-evident. □ 

Consider a strictly increasing sequence A 1 C A 2 C • • ■ C A m of cut-coded 
subsurfaces of S. Let a be a maximal multicurve (i.e., a pants decomposition) 
including dA* for all i. Each step of the sequence corresponds to some curve of a 
appearing for the first time as a component of dAk or an essential curve in Ak , so 
the maximum length of the sequence is twice the number of components of a. We 
have just observed: 



SHORT-WORD PSEUDO-ANOSOVS 



7 



Lemma 2.3. If £(5) > 1, a strictly increasing sequence of nonempty cut-coded 
subsurfaces of S has length at most 2£. 

It is easy to construct sequences realizing the upper bound. 

2.4. Active subsurfaces and the Omnibus Theorem. Thurston originally 
classified elements of Mod(S) by their action on the space of projective measured 
foliations, a piecewise linear space obtained by completing and projectivizing the 
space of weighted curves on S. He defined a pseudo-Anosov diffeomorphism as one 
that fixes a pair of transverse projective measured foliations, and proved that any 
pseudo-Anosov mapping class (i.e., any mapping class fixing no multicurve) has a 
representative pseudo-Anosov diffeomorphism. In fact he proved more: 

Theorem 2.4 (Thurston, |Thu88j . [Poe79] ) . Every element f ofMod(S) has a rep- 
resentative diffeomorphism F such that, after cutting S along some 1-dimensional 
submanifold C , F restricts to a pseudo-Anosov diffeomorphism on the union of 
some components of S\C, and has finite-order on the union of the rest. 

Birman-Lubotzky-McCarthy proved that C represents a unique isotopy class, 
and they gave a simple way to find it [BLM83J. We call this isotopy class the 
canonical reduction multicurve of /. Ivanov generalized to subgroups both the 
classification and the canonical cutting method [Iva92j. We build the notion of an 
active subsurface according to this last, largest perspective. Let us emphasize that, 
while we aim to paint a picture that seems perfectly natural, the validity of the 
definitions in this section depends on multiple lemmas and theorems from |Iva92j . 
which serves as reference for assertions presented without proof. 

Ivanov identified a congenial property of many mapping classes. Call a mapping 
class pure if, in the theorem above, it restricts to either the identity or a pseudo- 
Anosov on each component (in particular, no components are permuted). Because 
one may take the empty set for C in Theorem |2.4[ any pseudo-Anosov mapping 
classes is pure. The property of being pure is most useful for reducible mapping 
classes, because cutting along C "reduces" S to a collection of smaller subsurfaces. 
The point in what follows is to formalize this procedure. 

Call a subgroup pure if it consists entirely of pure mapping classes. Nontrivial 
pure mapping classes have infinite order, so they let us ignore the complications 
of torsion. Fortunately, the mapping class group has finite-index pure subgroups 
(Theorem 3, |Iva92j). In particular, one can take the kernel of the homomorphism 
Mod(S) -» Aut( J ffi(5,Z/3Z)), induced by the action of Mod(S) on homology. 
Name this subgroup T(S). 

Let us first define the canonical reduction multicurve cr(G) and active subsurface 
A(G) of a pure subgroup G < Mod(S'). The multicurve cr(G) consists of all curves 
7 such that (i) G fixes 7, and (ii) if some curve (3 intersects 7, then G does not fix (3. 
The active subsurface A(G) is cut-coded with multicurve 7. Included components 
correspond to those on which some element of G "acts pseudo-Anosov." Specifically, 
for each component Q of S\a(G), we have a homomorphism pg : G — > Mod(Q) such 
that PQ{g) is the mapping class of F\q, where F is a homeomorphism representing 
g. Let Gq denote the image pq(G). Because G is pure, for each component Q, the 
image Gq either contains a pseudo-Anosov, in which case Q is included, or is the 
trivial group, in which case Q is excluded (Theorem 7.16, [Iva92] ) . 

It follows that the annular domains of -4(G) correspond to neighborhoods of 
those 7 £ u(G) that only bound components Q for which Gq is trivial. Because by 
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definition 7 is not superfluous, some g € G restricts, in a neighborhood of 7, to a 
power of a Dehn twist about 7. Section [23] justifies why we consider Dehn twists 
annulus pseudo-Anosovs. 

If G is an arbitrary subgroup, choose a finite-index pure subgroup G' and define 
<r(G) = cr(G') and -4(G) = A(G'). Any choice of G' gives the same multicurve, 
and one can always take G' = G n G acts on S\a(G), although its elements 

may permute the components. For each component Q, one can define pq on the 
finite-index subgroup of G stabilizing Q. Each image Gq is finite or contains a 
pseudo-Anosov, and cr(G) is the minimal multicurve with this property (Theorem 
7.16, |Iva92] '). It follows that an infinite subgroup G contains a pseudo-Anosov if 
and only if c(G) is empty; let us call such a subgroup irreducible. 

For any mapping class let cr(g) = o~((g)) and A(g) = A{{g)). In this terminol- 
ogy, we recast Ivanov's Theorem 6.3 [Iva92 as follows: 

Theorem 2.5. For any G < Mod(S) there exists f e G such that A(f) = A{G). 



After Lemma 2.7 below, we can recognize Theorem 2.5 as Mosher's Omnibus 
Subgroup Theorem. We do not require this theorem for our proofs, only the validity 
of the definitions on which it is based. In fact, one can prove Theorem |1.1| with 
no mention of active subsurfaces for non-pure subgroups. However, its definition 
allows us to state the Main Theorem, Theore m |3.1| which takes the more general, 
perhaps more useful point of view of Theorem |2.5| adding the benefit of an / with 
bounded word length. 

Facts about active subsurfaces occupy the remainder of this section. Directly 
from definitions, we derive Lemma 12.61 below. With this we obtain Lemmas 12.71 - 



2.9 which enable our proof of the main theorem. Let us refer to domains of A(G) 
or A(g) as domains of G or g respectively. Say G moves the curve 7 € Co (50 if 
some element of G does not fix 7. 

Lemma 2.6. Suppose H and G are pure subgroups ofMod(S) and 7 G Cq(S). 

(a) H moves 7 if and only if '7 projects to some domain of H. 

(b) H and G fix and move the same curves if and only if A(H) = A(G). 

Lemma 2.7. If H < G then A(H) C A{G). 

Proof. We may replace H and G by the pure subgroups obtained by intersecting 
with T(S). Suppose A is a domain of G and B a domain of H . Because G, and 



thus H, fixes dA, 7Tb(<94) is empty. Lemma 2.2 guarantees that B nests in A or its 
complement. Now suppose B is in the complement of every domain of G. Then any 
curve essential in B is fixed by G, thus by H . This contradicts that B is a domain 
of H, unless B has no essential curves. Thus B is an annulus and dB consists of 
(two copies of) a single curve j3. Any 7 intersecting /3 is moved by H, hence by 
G. But because G fixes 7, this means 7 € c(G). Because B is the annulus around 
7, B isotopes into -4(G). Thus every domain of H nests in A(G), which means 
A{H) c A(G). □ 

In general, let A{X\, X2, ■ ■ .) denote the active subsurface of the group generated 
by Xi, X2, . . . , where Xi may be either elements or subgroups of a mapping class 
group. 

Lemma 2.8. Suppose H and G are pure elements or subgroups ofMod(S). Then 
A{H) C A{G) if and only if A(G) = A{G, H). 
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Figure 2. How to define the active subsurface of a group G. (1) Cut 
along canonical reduction multicurve. (2) Include components (shaded) 
on which G induces an irreducible subgroup. (3) Every element of G 
has a power supported on included components and implied annuli, by 
Lemma 2.7 



By Theorem 2.5 some element has exactly this support. 



Proof. Observe that A{H) C -4(G) implies that any curve fixed by G is fixed by 



H, and hence G and (G, H) fix the same curves. Apply Lemma 2.6 to G < (G, H 



for the forward direction, and Lemma 2.7 to H < (G,H) for the reverse. □ 



Lemma 2.9. Let G be a pure subgroup ofMod(S) generated by {H, Hi, . . . Hk, . . . }, 
where Hk are groups or elements. Either A(H) — A{G) or A{H) ^ A(H, Hk) for 
some k. 

Proof. Suppose A(H) = A(H,H k ) for all k. Let K, = (H, H u . . . H % ) . Using 
induction and Lem ma |2.8| one can show A(H) = A(Ki) for all i. Because the Ki 

one 



exhaust G, Lemma [2.7| implies A(g) C A{H) for all g € G. By Lemma 2.6 
knows any curve fixed by H has empty projection to all A(g), so H and G fix and 
move the same curves, and consequently A(H) = A(G). □ 

2.5. Machinations in the curve complex. This section collects several impor- 
tant curve complex results that provide the foundation for our proofs. Together, 
these results link mapping class behavior with curve complex geometry. Following 
our theme of interpreting Mod(S) via C(S), we note that these results also lead to 
a C(S')-centric Mod(S) classification. 

By definition, reducible mapping classes and finite-order mapping classes have 
bounded orbits in C(S). That pseudo-Anosovs have infinite-diameter orbits is corol- 
lary to a theorem of Masur and Minsky: 

Theorem 2.10 (Minimal translation of pseudo-Anosovs MM99J). There exists 
c = c(S) > such that, for any pseudo-Anosov g € Mod(S'), vertex 7 G Cq(S), and 
nonzero integer n, 

d s (g n ^) n )>c\n\. 

This gives us a way to recognize pseudo-Anosovs. It also suggests an alternate 
classification of elements on Mod(S'), by whether they have finite, finite-diameter, 
or infinite-diameter orbits in C(S). If one takes that classification as a starting 
point, Dehn twists clearly qualify as pseudo-Anosovs for the annulus mapping class 
group. 

For pure mapping classes, one may even refine this classification. Suppose g G 
Mod(5) is pure and has a domain Y. The observation that, for all 7 G Cq(S), 
^(9(7)) = 9(^y(i)), yields: 

Corollary 2.11 (Minimal translation on subsurfaces). There exists c = c(S) > 
such that, for any pure element g 6 Mod(5) with domain Y , vertex 7 € Cq(S) such 
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that 7ry(7) 7^ 7 and nonzero integer n, 

d Y {g n (n),l)>c\n\. 

Thus every nontrivial pure element g has infinite-diameter orbits in the curve 
complexes of its domains. Moreover, an orbit of g in the curve complex of one 
of its domains projects to a bounded diameter subset of the curve complex of any 
subsurface properly nested in that domain. This is because, if some g-orbit projects 
to a domain, most of its curves approximate the limiting laminations of g which 
fill that domain. Projection to the properly nested subsurface will not distinguish 
between the approximating curves, so the orbit looks roughly constant at large- 
magnitude powers of g. This is the heuristic behind another theorem of Masur 
and Minsky, which along with Corollary |2.11 is crucial to our core construction in 
Section 13.31 



Theorem 2.12 (Bounded geodesic image IMM00| ). Let Y be a proper domain of 
S. Let Q be a geodesic in C(S) whose vertices each project to Y . Then there is a 
constant M = M(S) such that 

diam Y (£) < M. 

This theorem enabled Behrstock to obtain Lemma [2. 13| below, which Chapter [3] 
frequently employs. Here we give the elementary proof, with constructive constants, 
by Chris Leininger. Most of it appeared previously in ManlO ; this version adds 
the possibility of annular domains. 

Lemma 2.13 (Behrstock [ Beh06j ). For any pair of overlapping domains Y and Z 
and any multicurve x projecting to both, 

dy (x, BZ) > 10 d z (x, BY) < 4 

Proof. First we gather the facts that prove the lemma when neither Y nor Z is an 
annulus. Suppose S' is a subsurface of S and £(>S), £(<S") > 1. Let uq and vq be 
curves on S which minimally intersect S' in sets of arcs. Suppose a u is one these 
arcs for uq, and u a component of the boundary of a neighborhood of a u U 8S'; 
suppose a v and v play the same role for u . Then u G t^s\ u q) an d v <G tts'( v o)- 
Define intersection number of arcs to be minimal over isotopy fixing the boundary 
setwise but not necessarily pointwise. One has: 

(1) If i(a u ,a v ) = 0, then d s >{u ,v ) < 4 

(2) If i(u,v) > 0, then i(u,v) > 2( d s'('^«)-2)/2 

(3) i(u, v) < 2 + 4 • i(a u , a v ) 



Statement (1) follows from the proof of Lemma 2.1 (Lemma 2.3 in MM00] ) . 
Straightforward induction proves (2), which Hempel records as Lemma 2.1 in 
|Hem01| . Fact (3) is the observation that essential curves from the regular neighbor- 
hoods of a u U dS' and a v U OS' intersect at most four times near every intersection 
of a u and a v , plus at most two more times near dS'. 

Now assume £(Y),£(Z) > 1. Because dy(x,dZ) > 10 > 2, the diameter is 
realized by curves u £ tty(x),v £ ny{dZ) such that, by (2), i(u,v) > 2 4 = 16. By 
the definition of iry , these u and v come from arcs a u C x PI Y and a v C dZ n Y 
respectively. By (3), i(a u ,a v ) > (16 — 2)/4 > 3. Thus a u is an arc of x intersected 
thrice by an arc a v of dZ, within the subsurface Y. Observe that one of the 
segments of a u between points of intersection must lie within Z. This segment is 
an arc a x of x disjoint from arcs of BY in Z. Fact (1) implies dz{x, BY) < 4. 
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Figure 3. Key point of Behrstock's lemma. Because x and dZ have 
large projection distance in Y, one can find an arc of x intersecting an 
arc of dZ three times. Extra-fine dashed lines represent where curves 
run behind the surface. 



The main idea of this proof works for annular domains after a few more relevant 
facts. Endow S with a hyperbolic metric and let A be an embedded annulus with 
geodesic core curve a; let A be the corresponding annular cover of S. Let u and v 
be geodesic curves in S with lifts u and v traversing the core curve of A. We have 
already mentioned 

(4) d A (u,v) = i(u,v) + 1. 

Let a be the unique lift of a corresponding to the core curve of A, and let Si and 
a<2 be the first lifts of a intersecting u on each side. On the sides of at opposite a, 
A is isometric to its pre- image in the universal cover S = H 2 . Because geodesies 
intersect only once in H 2 , 

(5) at most two of the intersections of u and v occur outside the open segment 
of u between the Sj . 

Now we can retrace the proof above, augmenting it to address the possibility 
that Y or Z is an annulus. Under some hyperbolic metric, x, dY, and dZ have 
geodesic representatives. Suppose Y is an annulus with geodesic core curve y and 
Y is the corresponding annular cover of S. If dy(x,dZ) > 10, (4) implies that 
some lifts x and dZ in Y intersect at least nine times. By (5), at least three of 
these intersections occur on an open segment of x between consecutive lifts of y, 
and a neighborhood of this segment embeds in S. As before, one finds an arc of 
x intersecting dZ at least three times in a neighborhood disjoint from dY. If Z is 
not an annulus, one repeats the conclusion of the first argument. 

On the other hand, suppose that Z is an annulus with geodesic core curve z, 
and Y is any domain. The arguments thus far tell us one has an arc of x thrice 
intersecting z in a neighborhood of that arc disjoint from dY. In the annular cover 
corresponding to Z, the three intersections correspond to a lift x of x intersecting 
the closed lift z of z and adjacent lifts zi, Z2 on each side. Any lift of dY cannot 
intersect x between these intersections, so (5) implies i(x, dY) < 2. Fact (4) implies 
dz(x,dy)<3. □ 
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Figure 4. Behrstock's lemma for the annulus. If x and z intersect three 
times with no intersections of dY in between, then in the annular cover 
corresponding to z, x can intersect dY no more than twice. Extra- fine 
dashed lines represent parts of curves on the far side of the annulus. 



Remark. Behrstock's lemma implies that C(S)-orbits of a mapping class g have 
bounded projection to subsurfaces that overlap with domains of g. Using stronger 
results, Theorem |2.12| in particular, one can prove that the g-orbit of any 7 £ 
C(S) projects to an unbounded set in the curve complex of 5" if and only if the 
orbit projects to S' and S' is a domain of g. Thus one obtains a refined, curve- 
complex-based classification of elements of Mod(S'), by letting the phrase "g is 
pseudo-Anosov on S'" mean that the <7-orbit of some 7 £ C(S) has infinite-diameter 
projection to C(S'). 



3. The recipe 

We begin at the end, proving the Main Theorem, modulo two propositions, in 
Section [3.1| There, we reduce the proof to the problem of generating a short-word 
full-support mapping class when the generating set has only two pure elements; 
even then, we must simultaneously accommodate multiple scenarios happening on 
different, non-interacting subsurfaces. The actual construction of pseudo-Anosovs 
is left to Section |3.3[ where in fact we identify all pseudo-Anosov elements in any 
group generated by two "sufficiently different" pure reducible mapping classes. That 
is, we prove Proposition 1 1 .2| of the introduction, which gives more than is needed for 
the main theorems. In general one is not so lucky as to start out with the condition 



on generators required for Proposition 1.2 to apply. Thus we also need a recipe for 



writing a sufficiently different pair of mapping classes, given an arbitrary pair of 
pure reducible mapping classes generating an irreducible subgroup. Proposition [3]4] 
fills that need; its proof occupies Section |3.2| 



3.1. Writing the short word. Restated in the terminology introduced in Section 
|2.4| our goal is to prove the following: 

Theorem 3.1 (Main Theorem). There exists a constant K — K(S) with the prop- 
erty that, for any subset £ C Mod(S'), there exists f £ (£) with Y,-length less than 
K, such thatA(f) = A(E). 
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First, let us narrow our starting point. By definition, the finite-index pure sub- 
group H = (S) n r(5) has the same active subsurface as (S). Suppose the index 
of H in (£) is d. Lemma 3.4 of Shalen and Wagreich [SW92] provides a generating 
set for H of words less than 2d — 1 in length according to the original generating 
set. Although they state the lemma for finite generating sets, nothing prevents 
the proof from applying to a general group. Therefore, if we find a full-support 
mapping class in H whose word length is less than li in the new generating set, its 
word length is less than l\(2d — 1) in the original generating set. Recall that T(5) 
is the kernel of the Mod(5) action on homology with coefficients in Z/3Z; its index, 
|Aut(i?i(S, Z/3Z))|, gives an upper bound for d. 

Let £' = {hi} be the new generating set for H. Renumbering as necessary, 



Lemmas |2.3| and 2.9 provide a sequence A(h\) C A{h 1 ,h 2 ) C A(h\, h%, hs) ■ ■ ■ 
terminating in at most 2£(S) steps at A{h\, h^, . . . , hk) — A{G). Let pi = hi 
and suppose for each subgroup (pi-±,hi) we can spell a full-support element pi 
(i.e. A(pi) — A{jPi-i,hi)) with word length less than li in the generating set 



{pi—i, hi}. By Lemma 2.7 A(hi) C A(pi); inductively assuming that A(pi-x) 



A{hi, . . . , hi-i), we also know A(hj) C A(pi-i) C A(pi) for all j < i. Then 



Lemmas 2.7 - 2.9 tell us A{pi) = A{h\, . . . , hi), for all i. In particular, p^ is 



full-support for H and has S'-length less than 1%, where k < 2£. 



We have reduced Theorem 3.1 to the case where A consists of a pair of pure 
mapping classes a and b. Let G = (a, 6). Finding an element of full-support for G 
is equivalent to finding a pseudo-Anosov on A(G), although we contend with the 
fact that A(G) may well be disconnected. On every domain of A(G), one of the 
following possibilities occurs: 

(i) a and b are pseudo-Anosov. 

(ii) a and b are both reducible. 

(hi) One of a and b is pseudo-Anosov and the other is reducible. 

To handle the first possibility, we quote a theorem of Fujiwara. An earlier in- 
carnation of this theorem inspired the short-word question in the first place. Call 
a pair of pseudo-Anosovs independent if all pairs of nontrivial powers fail to com- 
mute. In torsion-free groups such as those we consider, either two pseudo-Anosovs 
generate a cyclic subgroup, or they are independent (see the proof of Theorem 5.12 
in |Iva92^. In the latter case, Fujiwara's theorem applies: 

Theorem 3.2 (Fujiwara Fuj09 , partial version). There exists a constant L = 
L(S) with the following property. Suppose a, b <E Mod(5) are independent pseudo- 
Anosovs. Then for any n,m > L, (a n ,b m ) is an all-pseudo-Anosov rank-two free 
group. 



Case (ii) relies on two results. The first, Proposition 3.3 below, is a subset of 



Proposition |1.2| from the introduction; both are proved in Section |3.3| Define Q = 



Q(S) = max{3, (2M + 4) /c} where M = M (S) is the constant from Theorem 2.12 



and c = c(S) is the minimal translation length from Corollary 2.11 Recall that 
a pair of pure reducible mapping classes a and b are sufficiently different if some 
A £ A{a) and B E A(b) together fill S. 

Proposition 3.3. For any m,n> Q and sufficiently different pure mapping classes 
a and b, the group (a m , b n ) is a rank-two free group and its elements are each either 
pseudo-Anosov or conjugate to a power of a or b. 
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Proposition |3.3| would be irrelevant if not for Proposition |3.4| whose proof we 



leave to Section 3.2 Let £ = £(5) and c = c(S) as above. The following proposition 
works for any domain S' C S. 

Proposition 3.4. Suppose a and b are pure reducible mapping classes in Mod(S") 
and (a, b) is irreducible. For any n > £ — 1 and k > 20/c, the words 

ai = {b k a k ) n b k ■ a ■ ((b k a k ) n b k y 1 and b x = (a k b k ) n a k ■ b ■ {(a k b k ) n a k )- 1 

are sufficiently different pure reducible mapping classes. 

We handle case (iii) by converting it to either case (i) or (ii), so with the three 
results above we may finish the proof. Recall we face the situation where a and b 
are pure reducible mapping classes generating a group G with possibly disconnected 
active subsurface. Our task is to write a word in a and b that induces a pseudo- 
Anosov on every domain of G. 

Let ai and &i be as in Proposition |3.4| Because these are simply conjugates of 
a and b, they fulfill the same possibilities (i)-(iii) on each domain of A(G). Let 
L = max{L(5") : S' £ T}, where T is the finite set of topological types of domains 
of S (i.e., connected non-pants subsurfaces), and L is the constant in Theorem |3. 2 1 
Similarly let Q = max{Q(5") : S' G T}. Choose P > max{L,Q}. Consider the 
following word: 

uP Pu-P P 

w = o 1 a 1 b 1 a\ 

On domains where possibility (i) holds, either a\ and b\ commute and w = a\ p , 
a pseudo-Anosov, or a\ and b\ are independent and w is pseudo-Anosov by Theo- 
rem 3.2 On domains where possibility (ii) holds, w is pseudo-Anosov by Proposi- 
tions 3.3 and 3.4 On domains where possibility (iii) holds, and a\ is the pseudo- 
Anosov, we see by re-writing w as b\ af b^ ■ af that it is the product of powers 
of pseudo-Anosovs, so we may proceed as we did for case (i). Otherwise a\ is re- 
ducible, and for any 7 £ c(ai), dx(bf(j),j) > Pc > Qc > 3, where the leftmost 
inequality employs Corollary |2 . 1 1 1 and the rest are by construction. In particular, 
7 £ cr(ai) and 6f > (7) £ o-(bfaib^ ) together fill the domain, so a\ and bia\bf p are 



sufficiently different pure reducible mapping classes. Then Proposition |3.3| guar- 
antees w is pseudo-Anosov. Thus on all domains of A(G), w is pseudo-Anosov, 
meaning w is full-support for G. 

In terms of {a, b} the word length of w is 4P • (2fc(2n + 1) + 1). Therefore one 
can let I2 = 4P • (80£/c +1). For Theorem |3.1| one may take 

K(S) = 2 ■ |Aut(£Ti(5,Z/3Z))| ■ (320P£/c + 4P) 2 «. 

□ 

3.2. Finding sufficiently different reducibles. The purpose of this section is 
to prove Proposition |3.4| In the hypothesis of the proposition, a and b are pure 
reducible elements of Mod(S"), where S' is a domain in S, and (a, b) is irreducible. 
Given any k > 20/c(5) and n > £(S) - 1, let u = (b k a k ) n b k and v = (a k b k ) n a k . To 
prove the proposition, we show that any choice of a £ a^uau -1 ) and j3 £ cr(w6w _1 ) 
together fill S' . Necessarily a and f3 will bound some domains A £ Aiuau -1 ) and 
B £ A(vbv- X ), and A and B together fill S' . 

Without loss of generality we may assume S' = S. 



We prove Proposition 3.4 in two steps. First, we use Behrstock's Lemma (2.13) 



to follow the image of any a.\ £ o{a) and /3i £ o~(b) under the composition of 
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alternations of high powers of a and b. We find that u(ai) and v(/3±) have a certain 
subsurface projection property. The second step is to prove that this property 
implies the two curves fill S. 

For the purpose of step one, we introduce the overlap graph for a pair of pure 
mapping classes. Suppose a and b are pure nontrivial mapping classes generating 
the group G. Let {Ai}i<i< m and {Sj}i<j<n be the domains of a and b respectively. 
The overlap graph 0(a, b) consists of a vertex for every domain of a that overlaps 
with some domain of b, and for every domain of b that overlaps with some domain 
of a — recall that two domains overlap if the boundary of each domain projects to 
the other. Edges connect domains that overlap, so that one may color the vertices 
depending on whether they represent domains of a or b, obtaining a bipartite graph. 
Assign each edge length one so that 0(a, b) has the usual path metric. 




Figure 5. Example of an overlap graph. 



Note that the foregoing definition works whether or not G contains a pseudo- 
Anosov. Now let us assume that, as in the hypotheses for Proposition |3.4| , G 
does contain a pseudo-Anosov, or equivalently, A(G) is the connected surface 5". 
In particular a and b fix no common curve. If a is not pseudo-Anosov, then any 
domain Ai has essential boundary consisting of curves not fixed by b. This boundary 
must project to some domain Bj of b. If Bj has no essential boundary, then it is all 
of S' and b is pseudo-Anosov. Otherwise, the boundary projects to some domain 
of a. If dBj projects to Ai, then Bj and Ai overlap. If not, then Ai nests in Bj 



by Lemma 2.2 and dBj projects to some other domain A^. In this case, dAk in 
turn projects to Bj, because otherwise Bj nests in A;., implying Ai nests in A/., & 
contradiction. So Bj and Aj~ overlap. In either case 0(a, b) has at least two vertices 
and an edge connecting them. These observations imply 



Lemma 3.5. If A(G) is connected and 0(a,b) is empty, then at least one of a and 
b is pseudo-Anosov. 
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In the case where 0(a, b) is not empty, we distinguish between the domains rep- 
resented by vertices — call these domains overlappers — and those not represented. 
Revisiting the discussion above, we can extract a fact so handy we should name it: 

Lemma 3.6. When neither a nor b are pseudo-Anosov, any non-overlapper of one 
nests in an overlapper of the other. 

That observation helps us to the result below. 

Lemma 3.7. If A(G) is connected, then 0(a,b) is connected. 

Proof. Assuming 0(a,b) is not empty, we show how to find a path between any 
two vertices. Realize the domains as closed submanifolds with minimal pairwise 
intersection (i.e., ensure that boundary curves intersect essentially and use pair- 
wise disjoint submanifolds to represent domains for the same mapping class). For 
any two vertices X and Y of 0(a, b), choose points x and y in the corresponding 
domains, and connect these by a path p. Letting A — \J i Ai and B — (J . Bj, one 
sees that each component of S'\(AUB) is a disk or boundary-parallel annulus with 
a boundary component consisting of pieces of OA U dB. Thus one can isotope p to 
lie entirely within AUB, and furthermore transversal to the boundary curves. The 
path also lies entirely within overlappers, by Lemma [3.6| Tracing the path gives 
a finite sequence of domains, each overlapping with the neighbor before and after, 
alternating between A^s and Bj's by design. The same sequence appears as a path 
in 0(a, b) connecting X and Y. □ 

Now we can state precisely the first step to proving Proposition |3.4| Assume a 



and b are as in Proposition 3.4 with the same definitions as above for A — (J. Ai and 
B = [Jj Bj. Recall c = c(S) is the minimal translation constant from Theorem 



2.11 



and £(S) is surface complexity. 

Lemma 3.8. Suppose a\ and (5\ are curves in cr(a) and o~(b) respectively, and k 
and n are integers such that k > 20/c(S), and n > £,(S) — 1. For a = (b k a k ) n b k (ai) 
and f3 = (a k b k ) n a k ([3i), the following hold for any choice of Ai or Bj. 

(i) If dA projects to Bj, then so does a, and ds - (a, dA) > 14. 

(ii) If dB projects to Ai, then so does j3, and d^ (/?, <9B) > 14. 

Proof. By symmetry we need only prove (i), the case for a. We can use the overlap 
graph to track the image of a± under alternating applications of b k and a k . For 
ease of exposition, we will not always distinguish between overlapper domains and 
their representative vertices in 0(a, b). 

Let Yq be the vertices of 0(a, b) corresponding to the overlappers of b that 
intersect ai; note that Y cannot be empty. Let X Q be the vertices adjacent to io- 
For i E N, let Yi be the vertices adjacent to and let Xi be the vertices adjacent 

to Yi. In other words, Yi corresponds to fe-domains that overlap with a-domains of 
Xi—i, and Xi are the a-domains that overlap with ^-domains of Yi. 

Observe that the vertices of Y i+ i lie within a radius- two neighborhood of Fj. 
Because £,(S), the maximum number of disjoint curves on S, gives an upper bound 
on the number of domains for a mapping class in Mod(S'), twice £,(S) bounds 
the number of vertices in 0(a,b). Because 0(a, b) is connected, one sees that for 
n > — 1, Y n consists of all the ^-vertices, corresponding to all the overlapper 
domains of b. If a domain Bj of b is not an overlapper, then by Lemma 3.6 it nests 
in some domain of a, which precludes projection of dA to Bj. To prove Lemma 3.8 
it suffices to establish the following claim: 
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(*) For any domain Y of Y n , d Y ({b k a k ) n b k ( ai ),dA) > 14 
We induct on n. By definition, oi\ projects to any domain Y of Yq. Applying 



Theorem 2.11 



d Y {b k («i),ai) > c\k\ > 20 > 14 

Because ot\ is a component of the multicurve dA, and because projection distances 
are diameters, we obtain the n — case of (*): 

d Y {b k { ai ),dA) > 14 

Supposing (*) true for n = m, we prove it for n = m + 1. Consider any domain 

Y of F m , and recall that d\am. Y (dA) < 2, by Lemma 2.1 For any A t overlapping 
Y, the triangle inequality gives 

d Y ((b k a k ) m b k (a 1 ),dA i ) > 12 

In particular, (b k a k ) m b k (ai) projects to every Ai that overlaps some Y of Y m ; those 
Ai are exactly the vertices of X m . In other words, for any X € X m , there exists 
F G F TO such that 

d Y ((b k a k ) m b k (a 1 ),dX) > 12 
We may apply Behrstock's lemma, giving 

d x ({b k a k ) m b k ( ai ),dY) < 4 
On the other hand, Theorem |2 . 1 1 1 guarantees 

d^a^^r^ai), (b k a k ) m b k ( ai )) > c\k\ > 20 
Employing the triangle inequality, 

d x (a k (b k a k ) m b k ( ai ),dY) > 16 

The above holds for any Y € Y m , and X 6 X m . Given any y e F m +i, recall that 

V corresponds to a domain overlapping some X £ X m , which in turn overlaps 
with some Y £ Y m . We also know d\an\x{dY U dY') < 2, because dY' and are 
disjoint. Another triangle inequality gives 

d x (a fc (6' £ a fe ) m 6 fe (a 1 ),ay') > 14 

Again we can apply Behrstock's lemma. In fact we can mirror the last four in- 
equalites: 

dy,(a fc (6 fc a fe ) m & fc (ai),«9A) < 4 
d Y ,{b k a k {b k a k ) m b k {a 1 ),a k {b k a k ) m b k {a 1 )) > 20 
d Y ,(b k a k (b k a k ) m b k (a 1 ) 1 dX) > 16 
d Y ,{b k a k (b k a k ) m b k (a 1 ),dX r ) > 14 
where X' is any domain of X m+1 . Note that dX' is a subset of dA, and recall V 
was an arbitrary vertex of Y m+ i. Thus we may re- write the last inequality to give 
the 771 + I step of the induction claim: for any Y G Y m +i, 

d Y ((b k a k ) m+1 b k ( ai ),dA) > 14 

□ 



We are ready for the second step to proving Proposition |3.4| Its own proof will 
probably feel like deja vu. Again, a and b are as in the proposition, with the same 
definitions as above for A — \J i Ai and B = \JjBj. Note that in the following 
lemma, a and j3 are arbitrary. 
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Lemma 3.9. Suppose a, (5 £ Cq(S') satisfy the following: 

(i) If dA projects to Bj, then so does a, and dB D {ex, OA) > 14. 

(ii) If dB projects to Ai, then so does j3, and d^^/?, dB) > 14. 
Then a and j3 fill S' . 

Proof. Given an arbitrary curve 7 £ C(S'), we show it intersects either a or ft. 
Because 7 cannot be fixed by both a and 6, it projects to some domain, and we can 



choose this domain to be an overlapper (Lemma 3.6). Without loss of generality 



we may assume 7 projects to a domain Y of b, which overlaps with a domain X of 



a. Because multicurves have diameter-two projections (Lemma 2.1), properties (i) 
and (ii) imply: 

d Y (a,dX) > 12 
d x (P,dY) > 12 

Assuming 7 does not intersect a, we show it must intersect j3. Again Lemma |2.1 
implies dy(a, 7) < 2. The following inequalities employ the triangle inequality, 
Behrstock's lemma, and another triangle inequality. 

d Y ("f,dX) > 10 

dxh,dY) <4 

dx(l,l3)>8 

In yet another instance of Lemma |2.1| the last inequality shows 7 and /3 must 
intersect. □ 



Proof of Pro pos ition 3.4 Let u — (b k a k ) n a k and v = (a k b k ) n b k . Taken together, 
Lemmas 3.8 and 3.9 show that, for any a G ^(uau^ 1 ) and f3 £ a^vbv" 1 ), a and j3 fill 
S' . Therefore ai = uau^ 1 and bi = vbv" 1 are sufficiently different pure reducible 
mapping classes. □ 

3.3. Constructions. Here we give the pseudo-Anosov construction at the heart of 



the proof of our Main Theorem. That is, we prove Proposition 3.3 of Section 3.1 
which states that, in a group generated by two sufficiently different pure reducible 
mapping classes, every element is pseudo-Anosov except those conjugate to pow- 
ers of the generators. The second part of this section upgrades Proposition |3.3| to 
Proposition |1.2| given in the introduction, by proving the all-pseudo-Anosov sub- 
groups are convex cocompact. 

3.3.1. Of pseudo-Anosovs. Recall that a pseudo-Anosov is characterized by having 
infinite-diameter orbits in the curve complex. Thus we will know w is a pseudo- 
Anosov if distances d(7, ^"(7)) grow as n increases, where 7 is some vertex in C(S). 
Proposition |3.3| is an application of Lemma |3.10| below, which uses the Bounded 



Geodesic Image Theorem (Theorem 2.12) to glean geometric information about 



sequences of geodesies. The author learned this "bootstrapping" strategy from 
Chris Leininger, who takes a similar tack in [Leij . In the same vein one also has 
Proposition 5.2 in [K L08b| . but here the domains are not restricted. 



Let M = M{S) be the constant from Theorem 2.12 Below, properties (ii) and 



(iii) ensure that the projection in (iv) is well-defined. 

Lemma 3.10. Suppose {Yj} is a sequence of domains in S, and {Xj} a sequence 
of subsets ofCo(S), for which the following properties hold for all j: 
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(i) diam(-Xj-) < 2 

(ii) Xj and are pairwise disjoint 

(iii) The set of curves that project trivially to Yj is a subset of Xj 

(iv) dy. (iVj-i, Wj+i) > 2M for any choice ofwj-i £ Xj_i, Wj + i £ Xj+i 

Then for any Wi £ Xi and uii +k G Xi +k , the geodesic [wi,Wi+ k ] contains a vertex 
from Xj for i < j < i + k. Also, the Xj are pairwise disjoint. In particular, 
[to,-, Wi-\-k] has length at least k. 

Proof. First let us induct on k the claim that, for any u>i £ Xi and u>i+ k £ Xi +k , 
the geodesic [u)i, Wi + k] contains a vertex from Xj for j < j < i + k. This is vacuously 
true for k = 1. 

The general induction step works even for k = 2, but let us separate this case 
to highlight its use of the Bounded Geodesic Image Theorem, Theorem |2.12| By 
property (iv), dy i+1 (wi, ttfi+2) > 2M > M, so Theorem 2.12 requires [wi, Wi+2] 
contain a vertex v disjoint from Y^i. By property (iii), v is contained in Xi + \. Of 
course, the endpoints of [wi, Wi+2\ he in Xi and Xi + 2, so the induction claim holds. 

Now assume k > 2 and for any £ Xi,Wi+k—i £ the geodesic 

[uii, Wi+k-i] intersects Xj for i < j < i + k— 1. The main task is to show [uii, u>i+k\ 
intersects Xi + k-i- 

Choosing any uii+k-2 € our first step is to show one can pick a geodesic 

[wi, Wi+k-2] avoiding Xi+k-i- Suppose we are given a geodesic that doesn't, that 
is, for some w' l+k _ l £ X i+ k-i, 

[w h w i+k - 2 ] = [w l ,w' l+k _ 1 ] U [w' i+k _ 1 ,w l+k -2]- 

Let us require w' i+k _ 1 to be the first vertex along [wi, Wi+k-2} belonging to Xi+u-i, 
so that only the last vertex of [u>i, w'i +k _ 1 ] lies in Xi+k-i- The induction hypothesis 
applied to the segment [wi,w' i+k _ 1 ] implies that, for some w' i+k _ 2 £ Xi +k -2, 

[Wi,Wi +k - 2 } = [Wi,w' i+k _ 2 ] U K+/c-2. W -i+fe-l] U K+fc-l,Wi+fe-2]- 

Property (ii) ensures the second and third segments above each have length at least 
one. Thus their union [w' i+k _ 2 ,Wi +k ^2] nas length at least two. Property (i) tells 
us we can replace it with a length-2 geodesic contained entirely in Xi +k -2, giving 
a new [w t ,w t+k - 2 ] avoiding Xj+fe_i. 

Property (iii) ensures that every vertex of [wi, projects nontrivially to 

Yi+k-ii as does Wi+k- Projection distances make sense and Theorem 2.12[ the 
Bounded Geodesic Image Theorem, applies: 

dY t+k - 1 {w i ,w i +k-2) < diamy i+fe _ 1 ([w l , w l+k - 2 }) < M. 
Finally, a triangle inequality: 

dia,m Yi+k _ 1 ([w i ,w. l+k }) > d Yl+k _ 1 {w t ,w l+k ) 

> d Yt+k _ 1 (w 4 +fc-2, Wi+k) - dy.+^i (wi, w i+k - 2 ) 

> 2M — M = M 



Again by Theorem 2.12 we know [wi,u>i+ k ] intersects Xi+k-i- The end of the 



induction is easy: for some u>i+ k -\ in Xi+ k -ii 

[Wi,W i+k ] = [Wi,Wi +k -i] U [w i+k -i,W i+ k]. 

The induction hypothesis says the first segment on the right intersects each Xj, 
i<j<i + k— 1. So the geodesic on the left intersects Xj, i < j < i + k, as 
required. 
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Finally we check that the sets Xj are pairwise disjoint. Suppose z £ Xi D Xi + u 
for some nonzero k. By the part of the lemma already proved, the geodesic [z, z] 
contains vertices in Xj for i < j < i + k: simply put, all those Xj intersect at z. 
But consecutive Xj do not intersect. □ 



Proof of Proposition 3.3, Recall that Q = Q(S) = max{3, (2M + 4)/c} where 



M = M(S), the constant from Theorem 2.12 and c = c(S), the constant from 
Corollary |2.11| The hypothesis states that a and b are pure reducible mapping 
classes, with A g A(a) and B S A(b), and A and B together fill S. We will show 
that for any m,n > Q, (a m , b n ) is free and nontrivial elements of (a m , b n ) are either 
pseudo-Anosov or conjugate to powers of a m or b n . 

If S is a torus or four-punctured sphere, the only pure reducible mapping classes 
are Dehn twists about curves, and any pair of curves fill the surface. In this case 
the result for Q = 3 is due to |HT02j ; see also |Thu881 iPenMl IIsh96j . For the rest 
of the proof let us assume £(S) > 1. 

Let n, m > Q be arbitrary integers. Let a be a component of dA and (3 of dB. Let 
C a C Co (S) be the vertices with empty projection to A, and define Cb analogously. 
Observe that Ca and Cb sit in 1-neighbor hoods of a and f3, respectively, so they 
each have diameter 2 in C(S). Because any curve intersects either A or B, Ca and 
Cb contain no common vertices — this is the only place we use the fact that A and 
B fill S. 

Here's the whole point of our choice of Q: for any nonzero integer k, 

d A (CB,a mk (C B )) > d A {p, a mfc (/3)) > \mk\c > Qc> 2M + 4 

d B {CA,b nk {C A )) > d B {a,b nk (a)) > \nk\c> Qc > 2M + 4 
Notice that the domains A,B and sets Ca,Cb play the same roles for a m and b n 
as they do for a and b. To ease the burden of excessive exponents, we replace a m 
with a and b n with 6 for the remainder of the proof. In this new notation, our goal 
is to show that (a, b) is free, and every nontrivial element of (a, 6) is pseudo-Anosov 
except those conjugate to powers of a or b. We have already established that, for 
any nonzero fc, 

d A {C B ,a k (C B ))>2M + 4 and d B {C A , b k (C A )) > 2M + 4 
In particular, for any 7,7' 6 Cb or 5, 5' € Ca, 
(1) d A {l 1 a k {i))>2M and d B (6,b k (5')) > 2M 



We use this shortly to apply Lemma 3.10 



In the abstract free group on a and b, a word w has reduced form w — S1S2 . . . sr, 
where the syllables Sj are nontrivial powers of either a or b, and Sj is a power of a if 
and only if Si±i is a power of 6 (i.e. powers of a and b alternate). Define powerblind 
word length | • |* by |iu|„ = R, the number of syllables of w. 

Claim. For any word w, either d(w(a),a) > \w\„ or d(w((3),f3) > \w\*. 

It immediately follows that (a, b) is a rank-two free group. If is even, it is 
easy to check |u> n |* = n|iy|*. The claim implies d{w n {^f), 7) > n|ttf|*, where 7 is a 
or j3, depending on 10. In particular the orbit of w has infinite diameter in the curve 
complex, so w is a pseudo-Anosov. If is odd and neither conjugate to a power 
of a nor of 6, then it is conjugate to v such that \v\* is even. As v is pseudo-Anosov, 
so is its conjugate w. 

It remains to prove the claim. Towards this, we describe a sequence of domains 
and C (S')-subsets fulfilling the hypotheses of Lemma 3.10 Let w be the identity, 
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u>i = si, u>2 = S1S2, and so forth, so that Wi is the word formed by the first i 
syllables of w. If si is a power of a, let 1(a) correspond to the even integers, and 
1(b) to the odds; if Si is a power of b, switch the roles of even and odd. Define 
-R-length sequences of vertices 7 r , domains Y r , and sets X r as follows. 

7 r = w r (a) Y r = w r (A) X r = w r (C A ) W G 1(a) H [0, R] 

j r = w r (P) Y r = w r (B) X r = w r (C B ) Vr G 1(b) (1 [0, i?] 

In addition, set {7-1, Y_i,X_i} equal to {/3,-B,(7b}, if si is a power of a, or 
{a, A, Ca}, if si is a power of b. 

Each Xj is isometric to Ca or Cb, and furthermore pairs X,-, Xj+i are isometric 
to the pair Ca,Cb, disregarding order. Therefore the sequence {Yj,Xj} meets 
conditions (i) - (hi) of Lemma |3.10| Condition (iv) requires d-j^. (Vj-i, Vj+i) > 2M 
for any choice of Vj-% G Vj+i G -^i+i. For j = 0, this condition simply 

restates one of the inequalities in ([!]) above, so let us suppose j > 1. Without loss 
of generality, assume Y, = tu?(A) and Vj-i = Wj-i^-), Vj+i = Wj + i(^ + ) for some 
7_,7 + G C B . Because subsurface projection commutes naturally with the action 
of the mapping class group, 

d Wj (A)(Vj-i,V:j + i) = A A (wj 1 (v J -i),wj 1 (v 3+ i)) = d A (sj 1 (j-), s J+1 (j + )) 

Exactly one of Sj and Sj+i is a power of a, while the other is a power of b. In any 
case one knows that, for some 7,7' G C B and nonzero k, 

d A (sJ 1 (^), Sj+1 h + )) = d A (a k ( 1 ), 1 ') 

We used the fact that b fixes Cb setwise. Applying inequality above, we can 
conclude d w .( A )(vj-i,v ]+ i) > 2M. 




Figure 6. The sequence defined in the proof of Proposition 3.3 Here 
to = ab 2 a 3 6 4 a 5 . Vertices correspond to ji and shaded circles represent 
Xi, which each sit in the 1-neighborhood of 7;. The heavy, smooth line 
is a geodesic connecting /3 and to(/3); it intersects each of the Xj. 



By Lemma 3.10 the geodesies [7-1, Jr] and [70, Jr] have lengths at least R+l 
and R respectively. Depending on w, one of these geodesies is either [a,u>(a)] or 
[f3,w(f3)]. This proves the claim, thus the lemma. □ 



Remark. One can ask whether Proposition 3.3 above can be proven for n-tuples 
of pure mapping classes subject to appropriate conditions. For example, if pairwise 
they fulfill the requirements for a and b in the lemma, do sufficiently high powers 
generate rank n free groups? One would need to avoid the trivial counterexamples 
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arising from redundant generating sets such as {a, b, a k ba~ k }. Excluding this latter 
possibility, the immediate adaptation of the proof of Lemma |3.3| for n-tuples only 
gives a Q that depends on the particular n-tuple; proving that some Q works for 
any n-tuple seems to require more maneuvering. 

3.3.2. Of convex cocompact all-pseudo-Anosov subgroups. Now we upgrade Proposi- 
tion 3.3 to Proposition 1.2 of the introduction. The action of a group G < Mod(S') 
on the curve complex gives a quasi-isometric embedding G <->• C(S) if, for some 
7 e Ca(S), K > 1, and C > 0, and for all w € G, 

(2) K\w\ +C > d(7,w( 7 )) > \w\/K- C 

where \w\ gives word length with respect to some metric on G. From ([2]) one sees 
that such a group contains no non-trivial reducible elements. Let us call a group 
convex cocompact if it fulfills ([2| for some 7, K, and C as above. Hamenstadt and 
Kent-Leininger proved that this is equivalent to the definition of convex cocompact 
mapping class subgroups introduced by Farb and Mosher in [FMj . Convex cocom- 
pact subgroups which are free of finite rank are called Schottky. Farb and Mosher 
proved these are common in mapping class groups of closed surfaces, and Kent and 
Lcininger include a new proof which also works for non-closed surfaces: 

Theorem 3.11 (Abundance of Schottky groups [FMj . |KL08a| ). Given a finite 
set of pseudo- Anosovs {(71,52, ■ ■■9k\ which are independent (i.e., no pairs of powers 
commute), there exists I such that for all m > I, {g™, g™ , •■•<7™} is Schottky. 

Fujiwara found a uniform bound for the above theorem in the case of two- 
generator subgroups: 

Theorem 3.12 (Fujiwara |Fuj09| , full version). There exists a constant L = L(S) 
with the following property. Suppose a, b G Mod(5) are independent pseudo- Anosov 
elements. Then for any n,m> L, (a",6 m ) is Schottky. 



Proposition |1 ,2| provides a source of Schottky subgroups with arbitrary rank. It 
simply adds to Proposition |3.3| the claim that finitely generated all-pseudo-Anosov 
subgroups of (a, b) are Schottky. Recall that a and b are pure mapping classes with 
essential reduction curves a and /3 respectively, such that a U /3 fills S. Proposi- 
tion 3.3 tells us that G = (a, b) is a free group, and its all-pseudo-Anosov subgroups 
are exactly those containing no conjugates of powers of a or b. For example, any 
subgroup of the commutator group [G, G] qualifies. 



Proof of Proposition 1.2 Let G = (a, b) as in the proposition, and let H be a 
finitely generated, all-pseudo-Anosov subgroup of G. To show that H is Schottky, 
we need quasi-isometry constants for the inequalities in ([2]). For an orbit embedding 
of a finitely generated group, the upper bound of ([2| always comes for free, using 
any K greater than the largest distance some generator translates 7. So our work 
is the lower bound. In the proof of Lemma [3~3] we saw that d(7, iu( 7 )) > \w\* 
where 7 is one of a or j3, depending on w. It is not hard to check that, in general, 
d(a,u>(a)) > \w\* — 1 (the same is true replacing a with fj). However, powerblind 
word length is not a word metric with respect to any finite generating set for H. 
In what follows, we define a convenient generating set for H such that, letting | • \h 
denote the corresponding word metric, we have > where K is the 

size of this generating set. Then we can conclude that d(a, w(a)) > \w\h/K — 1, 
completing the proof. 
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The existence of this convenient generating set has no relation to our setting of 
subgroups of mapping class groups, so we isolate this fact as a separate technical 
lemma. We only need that G is a rank two free group generated by a and b. Suppose 
H is a finitely generated subgroup. For w € H, let \w\ denote its length in G with 
respect to the generating set {a, b}. Let Hi = {h e H : \h\ < 1} and choose L so 
that Hl generates H. Define | • \h as word length in H with respect to H^. Let K 
be the number of elements in H^. 

Lemma 3.13. If H contains no element conjugate to a power of a or b, then 
K\wU > \w\ H - 

Proof. Suppose w £ H has length \w\h = n, and w — hih 2 ---h n , where hi G 
Hl. One knows that > \hi\, because otherwise hihi + \ £ Hl, which 

contradicts that \w\h — n (one could take a shorter path to w in the Cay ley graph 
of H with respect to Hl, by replacing hi and hi + i with their product, another 
generator). In particular this means that strictly less than half of the word hi, 
written as a product of a's and &'s, gets canceled by a piece of the word h i+ i in a's 
and 6's. Likewise, strictly less than half gets canceled by a piece of the word hi-i. 
Therefore, at least the middle letter of hi, if \hi\ is odd (the middle pair of letters 
if \hi\ is even) gets contributed to the {a, 6}-spelling of the word w. Incidentally, 
this is showing that \w\ > \w\h, implying that finitely generated subgroups of the 
rank-two free group are quasi-isometrically embedded. 

Call the middle letter or pair of letters of each hi its core. Powerblind word length 
can be shorter than \w\h only if a string of consecutive hi's, say, hihi + \ ■ ■ ■ hk, 
all have a or a 2 at their core, or if they all have b or b 2 at their core, and these 
cores contribute to the same syllable (power of a or b) in the {a, 6}-spelling of w. 
In that case one can write, for i < j < k, 

hj = uj ■ x e ^ ■ Vj 

where x is a or b and x e ^' includes the core of each hj. Furthermore, Vj = uj^ ± , 
so that 

ft-A+i ■■■hk = UiX N v k 
where N = J2i<j<k e 0)- ^ is possible Ui or v k are empty words, but N cannot be 
zero, because otherwise \hih i+ i ■ ■ ■ h k \ < \hi\/2 + \h k \/2 < L, meaning the entire 
string can be replaced with a single element of Hl, contradicting the fact that 
\w\h = n. In this context, suppose hi = h k . Then u k = UiX p for some p. But 
because v k -i = , one has 

hi- - ■ frfc-i = u { x N uT 1 

for N' = N — e(k) — p. However, the stipulation that H contains no elements of 
G conjugate to powers of a or b, precludes this scenario. Thus if any consecutive 
string hi ■ ■ ■ h k in the _ff£-spelling of w contributes to the same syllable in the 
{a, &}-spelling of w, that string includes at most one instance of each element of 
H L . 

We have demonstrated a correspondence between letters hi and syllables of w 
written with respect to Hj, and {a, b} respectively: each letter corresponds to 
at least one syllable (the one in which its core appears), and at most K letters 
correspond to the same syllable. Therefore i4T|w|* > \w\h- O 



As described above, Lemma [3. 13 completes the proof of Proposition L2 □ 
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